AITopics | Basra

Collaborating Authors

Basra

Hotel in Iraqi capital Baghdad struck as attacks on US embassy intercepted

Al JazeeraMar-16-2026, 23:00:11 GMT

Could Iran be using China's BeiDou system? Drone strike hits Al-Rasheed hotel in Baghdad's Green Zone near US embassy, no casualties reported A prominent hotel in central Baghdad's heavily fortified Green Zone was struck by a drone, amid reports that Iraqi air defences intercepted an attack over the United States Embassy. The strike on Monday evening hit the top floor of Al-Rasheed Hotel, causing damage but no casualties, according to two Iraqi security officials cited by The Associated Press (AP) news agency. Security sources told the Reuters news agency that two Katyusha rockets had been intercepted that evening near the US Embassy in the Green Zone, which houses diplomatic missions as well as international institutions and government offices. Earlier Monday, the Iran-backed Kataib Hezbollah announced that Abu Ali Al-Askari, a prominent security official with the paramilitary group, had been killed, without giving details on the circumstances.

artificial intelligence, live navigation menu news show, news section africa asia us, (7 more...)

Al Jazeera

Country:

North America > United States (1.00)
Asia > Middle East > Iraq > Baghdad Governorate > Baghdad (0.86)
Asia > Middle East > Iran (0.67)
(12 more...)

Industry:

Government > Regional Government > North America Government > United States Government (1.00)
Government > Military (1.00)
Government > Foreign Policy (1.00)

Technology: Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles > Drones (0.36)

Add feedback

US warns Iraq must act against Iran-backed militia attacks on American assets

FOX NewsMar-16-2026, 12:27:12 GMT

Iraq's Prime Minister Mohammed Shia al-Sudani faces pressure to act against Iran-backed terrorist groups following increased attacks on U.S., European, and Kurdish assets in the country.

artificial intelligence, government, social media, (16 more...)

FOX News

Country:

Asia > Middle East > Iran (1.00)
Asia > Middle East > Iraq > Kurdistan Region (0.17)
Asia > North Korea (0.14)
(19 more...)

Industry:

Media > News (1.00)
Government > Regional Government > North America Government > United States Government (1.00)
Government > Regional Government > Asia Government > Middle East Government > Iraq Government (0.70)

Technology:

Information Technology > Communications > Social Media (0.98)
Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles > Drones (0.46)

Add feedback

Language Model Tokenizers Introduce Unfairness Between Languages

Neural Information Processing SystemsFeb-14-2026, 13:54:12 GMT

Recent language models have shown impressive multilingual performance, even when not explicitly trained for it. Despite this, there are concerns about the quality of their outputs across different languages. In this paper, we show how disparity in the treatment of different languages arises at the tokenization stage, well before a model is even invoked. The same text translated into different languages can have drastically different tok-enization lengths, with differences up to 15 times in some cases. These disparities persist even for tokenizers that are intentionally trained for multilingual support.

large language model, machine learning, natural language, (18 more...)

Neural Information Processing Systems

Country:

North America > Haiti (0.14)
Asia > Philippines > Luzon > Ilocos Region > Province of Pangasinan (0.04)
Europe > Switzerland > Zürich > Zürich (0.04)
(38 more...)

Genre: Overview (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.70)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.69)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.69)
Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.68)

Add feedback

74bb24dca8334adce292883b4b651eda-Paper-Conference.pdf

Neural Information Processing SystemsOct-8-2025, 22:13:22 GMT

large language model, machine learning, natural language, (17 more...)

Neural Information Processing Systems

Country:

North America > Haiti (0.14)
Asia > Philippines > Luzon > Ilocos Region > Province of Pangasinan (0.04)
Europe > Switzerland > Zürich > Zürich (0.04)
(38 more...)

Genre: Overview (0.46)

Technology:

Information Technology > Communications (0.93)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.70)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.69)
(2 more...)

Add feedback

Learning to Reason as Action Abstractions with Scalable Mid-Training RL

Zhang, Shenao, Yu, Donghan, Feng, Yihao, Jin, Bowen, Wang, Zhaoran, Peebles, John, Wang, Zirui

arXiv.org Machine LearningOct-7-2025

Large language models excel with reinforcement learning (RL), but fully unlocking this potential requires a mid-training stage. An effective mid-training phase should identify a compact set of useful actions and enable fast selection among them through online RL. We formalize this intuition by presenting the first theoretical result on how mid-training shapes post-training: it characterizes an action subspace that minimizes both the value approximation error from pruning and the RL error during subsequent planning. Our analysis reveals two key determinants of mid-training effectiveness: pruning efficiency, which shapes the prior of the initial RL policy, and its impact on RL convergence, which governs the extent to which that policy can be improved via online interactions. These results suggest that mid-training is most effective when the decision space is compact and the effective horizon is short, highlighting the importance of operating in the space of action abstractions rather than primitive actions. Building on these insights, we propose Reasoning as Action Abstractions (RA3), a scalable mid-training algorithm. Specifically, we derive a sequential variational lower bound and optimize it by iteratively discovering temporally-consistent latent structures via RL, followed by fine-tuning on the bootstrapped data. Experiments on code generation tasks demonstrate the effectiveness of our approach. Across multiple base models, RA3 improves the average performance on HumanEval and MBPP by 8 and 4 points over the base model and the next-token prediction baseline. Furthermore, RA3 achieves faster convergence and higher asymptotic performance in RLVR on HumanEval+, MBPP+, LiveCodeBench, and Codeforces.

abstraction, arxiv preprint arxiv, reasoning, (14 more...)

arXiv.org Machine Learning

2509.2581

Country:

North America > United States > Massachusetts > Hampshire County > Amherst (0.04)
Asia > Middle East > Jordan (0.04)
Asia > Middle East > Iraq > Basra Governorate > Basra (0.04)

Genre: Research Report > New Finding (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.89)

Add feedback

Understanding Outer Optimizers in Local SGD: Learning Rates, Momentum, and Acceleration

Khaled, Ahmed, Kale, Satyen, Douillard, Arthur, Jin, Chi, Fergus, Rob, Zaheer, Manzil

arXiv.org Machine LearningSep-15-2025

Modern machine learning often requires training with large batch size, distributed data, and massively parallel compute hardware (like mobile and other edge devices or distributed data centers). Communication becomes a major bottleneck in such settings but methods like Local Stochastic Gradient Descent (Local SGD) show great promise in reducing this additional communication overhead. Local SGD consists of three parts: a local optimization process, an aggregation mechanism, and an outer optimizer that uses the aggregated updates from the nodes to produce a new model. While there exists an extensive literature on understanding the impact of hyperparameters in the local optimization process, the choice of outer optimizer and its hyperparameters is less clear. We study the role of the outer optimizer in Local SGD, and prove new convergence guarantees for the algorithm. In particular, we show that tuning the outer learning rate allows us to (a) trade off between optimization error and stochastic gradient noise variance, and (b) make up for ill-tuning of the inner learning rate. Our theory suggests that the outer learning rate should sometimes be set to values greater than $1$. We extend our results to settings where we use momentum in the outer optimizer, and we show a similar role for the momentum-adjusted outer learning rate. We also study acceleration in the outer optimizer and show that it improves the convergence rate as a function of the number of communication rounds, improving upon the convergence rate of prior algorithms that apply acceleration locally. Finally, we also introduce a novel data-dependent analysis of Local SGD that yields further insights on outer learning rate tuning. We conduct comprehensive experiments with standard language models and various outer optimizers to validate our theory.

equation, local sgd, theorem 3, (10 more...)

arXiv.org Machine Learning

2509.10439

Country:

North America > United States > Virginia (0.04)
North America > United States > Louisiana > Orleans Parish > New Orleans (0.04)
Asia > Middle East > Iraq > Basra Governorate > Basra (0.04)
(6 more...)

Genre: Research Report (0.70)

Industry: Information Technology > Services (0.34)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (0.89)

Add feedback

A Single Merging Suffices: Recovering Server-based Learning Performance in Decentralized Learning

Zhu, Tongtian, Zhang, Tianyu, Wang, Mingze, Zhou, Zhanpeng, Wang, Can

arXiv.org Machine LearningJul-10-2025

Decentralized learning provides a scalable alternative to traditional parameter-server-based training, yet its performance is often hindered by limited peer-to-peer communication. In this paper, we study how communication should be scheduled over time, including determining when and how frequently devices synchronize. Our empirical results show that concentrating communication budgets in the later stages of decentralized training markedly improves global generalization. Surprisingly, we uncover that fully connected communication at the final step, implemented by a single global merging, is sufficient to match the performance of server-based training. We further show that low communication in decentralized learning preserves the \textit{mergeability} of local models throughout training. Our theoretical contributions, which explains these phenomena, are first to establish that the globally merged model of decentralized SGD can converge faster than centralized mini-batch SGD. Technically, we novelly reinterpret part of the discrepancy among local models, which were previously considered as detrimental noise, as constructive components that accelerate convergence. This work challenges the common belief that decentralized learning generalizes poorly under data heterogeneity and limited communication, while offering new insights into model merging and neural network loss landscapes.

communication, machine learning, natural language, (14 more...)

arXiv.org Machine Learning

2507.06542

Country:

North America > United States > California > Monterey County > Monterey (0.04)
North America > Canada > Quebec > Montreal (0.04)
Europe > United Kingdom > England > Oxfordshire > Oxford (0.04)
(2 more...)

Genre: Research Report > New Finding (1.00)

Industry: Information Technology (0.67)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.93)

Add feedback

DICE: Data Influence Cascade in Decentralized Learning

Zhu, Tongtian, Li, Wenhao, Wang, Can, He, Fengxiang

arXiv.org Machine LearningJul-10-2025

Decentralized learning offers a promising approach to crowdsource data consumptions and computational workloads across geographically distributed compute interconnected through peer-to-peer networks, accommodating the exponentially increasing demands. However, proper incentives are still in absence, considerably discouraging participation. Our vision is that a fair incentive mechanism relies on fair attribution of contributions to participating nodes, which faces non-trivial challenges arising from the localized connections making influence ``cascade'' in a decentralized network. To overcome this, we design the first method to estimate \textbf{D}ata \textbf{I}nfluence \textbf{C}ascad\textbf{E} (DICE) in a decentralized environment. Theoretically, the framework derives tractable approximations of influence cascade over arbitrary neighbor hops, suggesting the influence cascade is determined by an interplay of data, communication topology, and the curvature of loss landscape. DICE also lays the foundations for applications including selecting suitable collaborators and identifying malicious behaviors. Project page is available at https://raiden-zhu.github.io/blog/2025/DICE/.

data mining, large language model, machine learning, (18 more...)

arXiv.org Machine Learning

2507.06931

Country:

Asia > Middle East > Jordan (0.04)
Europe > United Kingdom > England > Oxfordshire > Oxford (0.04)
Europe > Russia > Central Federal District > Moscow Oblast > Moscow (0.04)
(3 more...)

Genre:

Overview (1.00)
Research Report > Promising Solution (0.34)

Industry: Information Technology > Security & Privacy (1.00)

Technology:

Information Technology > Security & Privacy (1.00)
Information Technology > Communications > Networks (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
(5 more...)

Add feedback

Integrating Vehicle Acoustic Data for Enhanced Urban Traffic Management: A Study on Speed Classification in Suzhou

Fan, Pengfei, Zhang, Yuli, Wang, Xinheng, Jiang, Ruiyuan, Gu, Hankang, Jia, Dongyao, Wang, Shangbo

arXiv.org Artificial IntelligenceJun-27-2025

This study presents and publicly releases the Suzhou Urban Road Acoustic Dataset (SZUR-Acoustic Dataset), which is accompanied by comprehensive data-acquisition protocols and annotation guidelines to ensure transparency and reproducibility of the experimental workflow. To model the coupling between vehicular noise and driving speed, we propose a bimodal-feature-fusion deep convolutional neural network (BMCNN). During preprocessing, an adaptive denoising and normalization strategy is applied to suppress environmental background interference; in the network architecture, parallel branches extract Mel-frequency cepstral coefficients (MFCCs) and wavelet-packet energy features, which are subsequently fused via a cross-modal attention mechanism in the intermediate feature space to fully exploit time-frequency information. Experimental results demonstrate that BMCNN achieves a classification accuracy of 87.56% on the SZUR-Acoustic Dataset and 96.28% on the public IDMT-Traffic dataset. Ablation studies and robustness tests on the Suzhou dataset further validate the contributions of each module to performance improvement and overfitting mitigation. The proposed acoustics-based speed classification method can be integrated into smart-city traffic management systems for real-time noise monitoring and speed estimation, thereby optimizing traffic flow control, reducing roadside noise pollution, and supporting sustainable urban planning.

accuracy, artificial intelligence, machine learning, (18 more...)

arXiv.org Artificial Intelligence

2506.21269

Country:

South America > Chile (0.04)
Europe > United Kingdom (0.04)
Asia > Middle East > Iraq > Basra Governorate > Basra (0.04)
(3 more...)

Genre:

Overview (1.00)
Research Report > New Finding (0.66)

Industry: Transportation > Ground > Road (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

How 3D-printed guns are spreading online

BBC NewsJun-19-2025, 00:17:34 GMT

We did not proceed with the transaction to test Jessy's claims. While his casual attitude suggested he might have been a scammer, his ability to advertise on Meta and operate on Telegram highlights apparent loopholes that real gun dealers could exploit. When contacted, Meta told the BBC that the adverts we highlighted had been "automatically disabled in line with our policies", and that inclusion in its ad library "doesn't necessarily mean the ad is still live or visible". Telegram said that Jessy's account had been proactively removed for breaching its policies. A spokesperson added: "The sale of weapons is explicitly forbidden by Telegram's terms of service and is removed whenever discovered. Moderators empowered with custom AI and machine learning tools proactively monitor public parts of the platform and accept reports in order to remove millions of pieces of harmful content each day, including the sale of weapons."

3d-printed gun, machine learning, social media, (5 more...)

BBC News

Country:

North America > United States (0.18)
Asia > Myanmar (0.06)
Asia > Middle East > Iraq > Basra Governorate > Basra (0.06)

Industry:

Law Enforcement & Public Safety > Crime Prevention & Enforcement (0.38)
Law (0.35)
Education > Health & Safety > School Safety & Security > School Violence (0.35)

Technology:

Information Technology > Communications > Social Media (0.83)
Information Technology > Artificial Intelligence > Machine Learning (0.58)

Add feedback